The Journal ok Biological Chemistry 

© 1997 by The American Society for Biochemistry and Molecular Biology, Inc. 



Vol. 272, No. 33, Issue of August 15, pp. 20456-20462, 1997 
Printed in U.S.A. 



Optimal Subsite Occupancy and Design of a Selective 
Inhibitor of Urokinase* 

(Received for publication, May 12, 1997) 

Song-Hua Ket, Gary S. Coombs§1l, Kathy Tachiast, David R. Corey§||, and Edwin L. Madisont'^* 

From the tDepartment of Vascular Biology, The Scripps Research Institute, La Jolla, California 92037 and the 
department of Pharmacology and Howard Hughes Medical Institute, University of Texas Southwestern Medical 
Center, Dallas. Texas 75235 



Human urokinase type plasminogen activator (u-PA) 
is a member of the chymotrypsin family of serine pro- 
teases that can play important roles in both health and 
disease. We have used substrate phage display tech- 
niques to characterize the specificity of this enzyme in 
detail and to identify peptides that are cleaved 840-5300 
times more efficiently by u-PA than peptides containing 
the physiological target sequence of the enzyme. In ad- 
dition, unlike peptides containing the physiological tar- 
get sequence, the peptide substrates selected in this 
study were cleaved as much as 120 times more effi- 
ciently by u-PA than by tissue type plasminogen activa- 
tor (t-PA), an intimately related enzyme. Analysis of the 
selected peptide substrates strongly suggested that the 
primary sequence SGRSA, ftrom position P3 to P2', rep- 
resents optimal subsite occupancy for substrates of 
u-PA. Insights gained in these investigations were used 
to design a variant of plasminogen activator inhibitor 
type 1, the primary physiological inhibitor of both u-PA 
and t-PA, that inhibited u-PA approximately 70 times 
more rapidly than it inhibited t-PA. These observations 
provide a solid foundation for the design of highly se- 
lective, high affinity inhibitors of u-PA and, conse- 
quently, may facilitate the development of novel thera- 
peutic agents to inhibit the initiation and/or 
progression of selected human tumors. 



Local activation and aggregation of platelets, followed by 
initiation of the blood coagulation cascade, assure that a fibrin 
clot will form rapidly in response to vascular injury (1). The 
presence of this thrombus, however, must be transient if the 
damaged tissue is to be remodeled and normal blood flow 
restored. The fibrinolytic system, which accomplishes the en- 
zymatic degradation of fibrin, is therefore an essential compo- 
nent of the hemostatic system (1). The ultimate product of the 
fibrinoljrtic system is plasmin, a chymotrypsin family enzyme 
with relatively broad, trypsin-like primary specificity that is 
directly responsible for the efficient degradation of a fibrin clot 
(2). Production of this mature proteolytic enzyme from the 



* This study was supported in part by National Institutes of Health 
(NIH) Grants ROl HL52475 and POl HL31950 (to E. L. M.). The costs 
of publication of this article were defrayed in part by the payment of 
page charges. This article must therefore be hereby marked "advertise- 
ment" in accordance with 18 U.S.C. Section 1734 solely to indicate this 
fact. 

H Supported by NIH Grant T32GMO8203. 

II An assistant investigator with the Howard Hughes Medical Insti- 
tute. To whom correspondence may be addressed: Dept. of Pharmacol- 
ogy and Howard Hughes Medical Institute, University of Texas South- 
western Medical Ctr., Dallas, TX 75235. 

** To whom correspondence may be addressed: Dept. of Vascular 
Biology (VB-1), The Scripps Research Institute, 10550 N. Torrey Pines 
Rd.. La Jolla, CA 92037. Fax: 619-784-7323. 



inactive precursor, or zymogen, plasminogen is the rate-limit- 
ing step in the fibrinolytic cascade (2, 3). Catalysis of this key, 
regulatory reaction is tightly controlled in vivo and is mediated 
by two enzymes present in human plasma, u-PA^ and t-PA 

(3-6). 

u-PA and t-PA are very closely related members of the chy- 
motrypsin gene family, "niese two proteases possess extremely 
high structural similarity (7, 8), share the same primary phys- 
iological substrate (plasminogen) and inhibitor (plasminogen 
activator inhibitor, type 1) (3), and, unlike plasmin, exhibit 
remarkably stringent substrate specificity (9-11), Despite 
their striking similarities, the physiological roles of t-PA and 
u-PA are distinct (5, 6), and many studies (5, 6, 12-18) suggest 
selective inhibition of either enzyme might have beneficial 
therapeutic effects. Mice lacking t-PA, for example, are resist- 
ant to specific excitotoxins that cause extensive neurodegen- 
eration in wild type mice (13), and mice lacking u-PA exhibit 
defects in the proliferation and/or migration of smooth muscle 
cells in a model of restenosis following vascular injury (5, 6). 

A large body of experimental evidence firom studies involving 
both model systems and human patients suggests that u-PA 
may play an important role in tumor biology and provides a 
compelling rationale to pursue the development of u-PA inhib- 
itors. For example, anti-u-PA antibodies inhibit metastasis of 
HEpS human carcinoma cells to chick embryo lymph nodes, 
heart, and lung (19), and similar studies demonstrated that 
these antibodies inhibit lung metastasis in mice following in- 
jection of B16 melanoma cells into the tail vein (20). Anti-u-PA 
antibodies also inhibit both local invasiveness and lung metas- 
tasis in nude mice bearing subcutaneous MDA-MB-231 breast 
carcinoma tumors (21). In addition, a recent study indicated 
that u-PA-deficient mice are resistant to the induction and/or 
progression of several tumor types in a two-stage, chemical 
carcinogenesis model (18). Finally, high levels of tumor-associ- 
ated u-PA correlate strongly with both a shortened disease-free 
interval and poor survival in several different human cancers 
(22-24). 

Because mice lacking either u-PA or t-PA do not develop 
thrombotic disorders, selective inhibition of either of these two 
enzymes seems unlikely to create thrombotic complications in 
vivo. On the other hand, mice lacking both u-PA and t-PA 
suffer severe thrombosis in many organs and tissues, resulting 
in a significantly reduced life expectancy (5, 6). Nonselective 
inhibition of these two enzymes, therefore, seems almost cer- 
tain to produce catastrophic consequences in the clinical set- 
ting. Consequently, significant interest exists in the develop- 
ment of inhibitors that are stringently specific for either t-PA 



' The abbreviations used are: u-PA, urokinase type plasminogen ac- 
tivator; t-PA, tissue type plasminogen activator; fTC, fAFFl-tether C; 
HPLC, high pressure liquid chromatography; PAI-1, plasminogen acti- 
vator inhibitor type 1. 



EXHIBIT 



Optimal Subsite Occupancy for u-PA 



20457 



or u-PA, which are expected to facilitate a detailed investiga- 
tion of the precise roles of the two enzymes in several important 
pathological processes and may aid the development of novel 
therapeutic agents to combat these processes. Rational design 
of these selective inhibitors is greatly complicated, however, by 
the absence of obvious "lead compounds"; both their primary 
physiological substrate and inhibitors fail to discriminate be- 
tween the two closely related proteases. 

We have used substrate phage display (25, 26) to elucidate 
optimal subsite occupancy of u-PA. Peptide substrates that 
match the consensus sequence for substrates of u-PA derived 
from these studies are cleaved by u-PA 840-5300 times more 
efficiently than control peptides containing the physiological 
target sequence present in plasminogen. In addition, xmlike the 
plasminogen-derived control peptides, the selected peptides ex- 
hibit substantial selectivity for cleavage by u-PA versus t-PA. 
Information gained in these investigations was used to aug- 
ment the u-PA/t-PA selectivity of PAI-1, the physiological in- 
hibitor of both t-PA and u-PA (27, 28); suggests potential lead 
compounds for the design of selective, small molecule inhibitors 
of u-PA; and provides new insights into the divergent evolution 
of molecular recognition by intimately related enzymes. 

MATERIALS AND METHODS 
Reagents — Competent MC1061 (F) Escherichia coli and nitrocellu- 
lose were purchased from Bio-Rad Laboratories. Pansorbin (Protein 

h-hearmg Staphylococcus aureus) cells were obtained from Calbiochem. 
K91 (F+) and MC1061 (F") strains of E. coli were provided by Steve 
Cwirla (Affymax). mAb 3-E7 was purchased from Gramsch Laborato- 
ries (Schwabhausen, FRG). u-PA was obtained from Jack Henkm 
(Abbott Laboratories). 

Construction of the phage vector fAFFl-tether C (fTC) and the ran- 
dom hexapeptide library fAFF-TC-LIB has been previously described 
(26). Control substrate phage fTC-PL, which contained the physiologi- 
cal target sequence for u-PA and t-PA, was constructed by hybridizing 
the single-stranded oligonucleotides 5'-TCGAGCGGTGGATCCGGTA- 
CTGGTCGTACTGGTCATGCTCTGGTAC-3' and 5'-CGCCACCTAGG- 
CCAGGACCAGCACAACAACCACGAGAC-3' and then ligating the an- 
nealed, double-stranded products into the X/ioI/KpnI-cut vector fTC. All 
constructs were first transformed into MC1061 by electroporation and 
then transferred into K9L 

Measurement of Enzyme Concentrations — Concentrations of func- 
tional t-PA and u-PA were measured by active site titration with 
4-methylumbellifeiyl p-guanidinobenzoate (29) using a Perkin-Elmer 
LS SOB Luminescence Fluorometer as described previously (9, 30). In 
addition, the enzymes were titrated with a standard PAI-1 preparation 
that had been previously titrated against a trypsin primary standard. 
Total enzyme concentrations were measured by enzyme-linked immu- 
nosorbent assay. 

Phage Selection Using u-PA — Substrate phage display was originally 
developed by Matthews and Wells (25) using monovalent phage, and an 
alternative method that used multivalent phage was reported later by 
Smith and Navre (26). Multivalent substrate phage were screened with 
u-PA using reaction conditions identical to those previously reported for 
t-PA (31) except that digestion of the phage was performed using 
enzyme concentrations varying from 2 to 10 /xg/ml and incubation times 
varying from 0.5 to 10 h. 

Dot Blot Assay of Phage Proteolysis— Phage precipitation and dot blot 
analysis were performed as described previously (26, 31). Individual 
phage stocks were prepared and digested with no enzyme, t-PA, u-PA, 
or u-PA in the presence of 1 mM amiloride, a specific inhibitor of u-PA, 
for periods of time varying from 15 min to 10 h. Individual reaction 
mixtures were spotted onto a nitrocellulose filter using a dot blotter 
apparatus (Bio-Rad). The filter was probed with mAb 3E-7 and devel- 
oped using the Amersham Western ECL kit. Loss of positive staining 
indicates loss of antibody epitopes from the phage due to proteolytic 
cleavage of the randomized hexamer region. 

Preparation and Sequencing ofDNA from Phage Clones — DNA sam- 
ples were prepared from interesting phage clones as described previ- 
ously (31). Briefly, phage were precipitated from a 1-ml overnight 
culture by adding 200 fd of 20% polyethylene glycol in 2.5 M NaCl. The 
mixture was incubated on ice for 30 min, and the phage pellet was 
collected by microcentrifugation for 5 min. The phage were resuspended 
in 40 nl oflysis buffer (10 mM Tris-HCl, pH 7.6, 0.1 mM EDTA, 0.5% 



Triton X-100) and heated at 80 °C for 15 min. Single-stranded DNA was 
purified by phenol extraction and ethanol precipitation and sequenced 
by the dideoxy method. 

Kinetics of Cleavage of Synthetic Peptides by t-PA and u-PA — Pep- 
tides were synthesized and purified as described (9). Kinetic data were 
obtained by incubating various concentrations of peptide with a con- 
stant enzyme concentration to achieve between 5 and 20% cleavage of 
the peptide in each reaction. For assays with u-PA, enzyme concentra- 
tion was either 815 or 635 nM. For assays with t-PA enzyme, concen- 
tration was 700 nM. Peptide concentrations were chosen where possible 
to surround if„ and in all cases were between 0.5 and 32 mM. The buffer 
used in these assays has been described (9). Reactions were stopped by 
the addition of trifluoroacetic acid to 0.33% or by freezing on dry ice. 
Cleavage of the 13- and 14-residue peptides was monitored by reverse 
phase HPLC as described (9). The 4-6-residue peptides were acylated 
at their amino termini and amidated at their carboxyl termini. Cleav- 
age of the 4-6-residue peptides was monitored by hydrophilic interac- 
tion HPLC chromatography (32) using a polyhydroxyaspartamine col- 
umn from PolyLC (Columbia, MD). Buffer A was 50 mM triethylamine 
phosphate in 10% acetonitrile, and buffer B was 10 mM triethylamine 
phosphate in 80% acetonitrile. Peptides were eluted by a gradient that 
was varied from 100% buffer B to 100% buffer A during a 13-min 
interval. The percentage of cleaved peptide was calculated by dividing 
the area under the product peaks by the total area under substrate and 
product peaks. For all peptides containing multiple basic residues, 
mass spectral analysis of products confirmed that cleavage occurred at 
a single site and identified the scissile bond. Data were interpreted by 
Eadie-Hofstee analysis. Errors were determined as described (33) and 
were <25%. 

Site-directed Mutagenesis and Construction of an Expression Vector 
Encoding a Recombinant Variant of PAI-1 — The expression vector 
pPAIST7HS was derived from the plasmid pBR322 and contained a 
full-length cDNA encoding human PAI-1 that was transcribed from a 
T7 gene 10 promoter (34). The 300-base pair SaWBamHl fragment of 
human PAI-1 was subcloned from pPAIST7HS into bacteriophage 
MlSmplS, Single-stranded DNA produced by the recombinant 
M13mpl8 constructs was used as a template for site-specific mutagen- 
esis according to the method of Zoller and Smith (35) as modified by 
Kunkel (36). The mutagenic oligonucleotide had the sequence 
5' - CCACAGCTGTCATAGGCAGCGGCAAAAGCGCCCCCGAGGAGA- 
TC-3'. 

Following mutagenesis, single-stranded DNA corresponding to the 
entire 300-base pair Sall-BamHl fragment was fully sequenced to en- 
sure the presence of the desired mutations and the absence of any 
additional mutation. The 300-base pair Sall-BamHl double-stranded 
DNA fragment from the mutated, replicative form DNA was used to 
replace the corresponding fragment in pPAIST7HS to yield a full-length 
cDNA encoding PAI-1/UKl, which contained the amino acid sequence 
GSGKSA from the P4 to P2' position of the reactive center loop. 

Expression and Purification of Recombinant Wild Type PAI-1 and the 
Variant PAI-1 /UKl — Expression of wild type and the mutated variant 
of PAI-1 was accomplished in the E. coli strain BL21[DE3]pLys' (No- 
vagen), which synthesizes T7 RNA polymerase in the presence of iso- 
propyl-l-thio-/3-D-galactopyranoside. Bacterial cultures were grown at 
37 °C with vigorous shaking to an A^gs of 0.9-1.1, and isopropyl-l-thio- 
/3-D-galactopyranoside was added to a final concentration of 1 mM to 
induce the synthesis of T7 RNA polymerase and the production of PAI-1 
proteins. Cultures were grown for an additional 1-2 h at 37 °C and then 
shifted to 30 °C for 2-6 h. 

Cells were pelleted by centrifugation at 8000 X g for 20 min at 4 °C 
and resuspended in 40 ml of cold start buffer (20 mM sodium acetate, 
200 mM NaCl and 0.01% Tween 20, pH 5.6). The cell suspension was 
disrupted in a French pressure cell (Aminco), and cellular debris was 
removed by ultracentriiiigation for 25 min at 32,000 X g. 

Purification of soluble, active PAI-1 was performed as described 
previously (37). PAI-1 containing supernatants were injected onto a 
XK-26 column (Pharmacia Biotech Inc.) packed with CM-50 Sephadex 
(Pharmacia). The column was washed with 5 column volumes of start 
buffer (20 mM sodium acetate, 200 mM NaCl, and 0.01% Tween 20, pH 
5.6), and PAI-1 proteins were eluted using a 0.2-1.8 m linear gradient of 
NaCl in the same buffer. Peak fractions were collected, pooled, and 
concentrated using a Centriplus 30 concentrator (Amicon). Purified 
preparations were analyzed by activity measurements using standard, 
direct assays of t-PA, SDS-polyacrylamide gel electrophoresis, and 
measurement of optical density at 280 nm. 

Measurement of Active PAI-1 in Purified Preparations — A primary 
standard of trypsin was prepared by active site titration using p-nitro- 
phenyl guanidinobenzoate HCl as described previously (38). Concentra- 



Optimal Subsite Occupancy for u-PA 



n 91 isolated substrate phage clones 



f 



S G R 
S K S G R 
S S R 
TAR 
TAR 
T S R 
T S R 
T T R 
T T S R 
W S G R 



A R Q 
S(L) 
NAD 
L R G 
SDN 
M G T 
Q A 0 
R N K 
R S 
S G 



(G)G R R G N R 
(G)G R S V N N 



I S T A R 
(S G)K 
K K R 

K M S A R 
(G)K R 
(G)K R 

(s g; 

(S G) 

(s g: 
(s g; 



M(L) 

A A D V T 
T N D 



(S 



R D \- 

V S K N 

S A D A A 

A A A M V 

A G N I R 

A H R D N 

A R D D R 

A R H M V 

A R S P R 

;)R A V G H Q 



(S G) 

(S G) 

(S G) 

(S G) 



R G R S A V 

R G V D M N 

R G V K M H 

R H R S D I 

R K G Q G G 

R K L H M N 

R K M D M G 

R K M D R S 

R K M R M G 

R K N Q R V 

R K Q R D S 

R K R V G A 



^ N A V E P 

I N D R L N 

I N G K S R 

;)R N M P L L 

I N T G S H 

3)R R M T M G 

I R L N M 

I T L D F 

;)R R A V S N 

S)R S A K V D 

;)R S A N A I 

;)R S A T R D 

;)R S A V V K 

! D Q F L 

;)R S D N P N 

i S E R S L 

!)R S G D P G 

!)R S G N T T 

5)R S G N M G 

;)R S N G V G 

!)R S P D G M 

I S R R L P 

I S R V T S 

J S S H S S 

« S S 0 A A 

3)R S S S S H 

3)R S S S T V 

3)R S T D L G 

3)R S T N V E 

3)R S T R H K 

3)R S Y T N S 

3)R T S P S T 

3)R T S V N L 

C R A S I 
/ R(L V) 
/ R(L V) 

3 R S T G 

I R D R I 

\ R N Y K 



tions of active molecules in purified preparations of wild type or mu- 
tated PAI-ls were determined by titration of standardized trypsin as 
described by Olson et al. (39) and by titration of standardized t-PA 
preparations. 

Kinetic Analysis of the Inhibition of t-PA and u-PA by Recombinant 
PAI-1 and PAI-1/UKl— Second order rate constants (fe,) for inhibition 
of t-PA or u-PA were determined using pseudo-first order (A; < 2 X 10°) 
or second order (A; > 2 X 10") conditions. For each reaction, the con- 
centrations of enzyme and inhibitor were chosen to yield several data 
points for which the residual enzymatic activity varied between 20 and 
80% of the initial activity. Reaction conditions and data analysis for 
pseudo-first order reactions were as described previously (40-43). 

For second order reactions, equimolar concentrations of u-PA and 
PAI-1 were mixed directly in microtiter plate wells and preincubated at 
room temperature for periods of time varying from 0 to 30 min. Follow- 
ing preincubation, the mixtures were quenched with an excess of neu- 
tralizing anti-PAI-1 antibody (generously provided by Dr. David Lo- 
skutofO, and residual enzymatic activity was measured using a 
standard, indirect chromogenic assay. These indirect, chromogenic as- 
says were compared with control reactions containing no PAI-1 or to 
which PAI-1 was added after preincubation and the addition of anti- 
PAI-1 antibody, plasminogen, and Spec PL to the reaction mixture. 
Data were analyzed by plotting the reciprocal of the residual enzyme 
concentration versus the time of preincubation. 

RESULTS 

Construction and Use of Substrate Phage Libraries — A poly- 
valent fd phage library that displayed random hexapeptide 



sequences and contained 2 X 10^ independent recombinants 
was prepared (25, 31). Each member of this library displayed 
an N-terminal extension from phage coat protein III that con- 
tained a randomized region of six amino acids, a six-residue 
hnker sequence (SSGGSG), and the epitopes for mAbs 179 and 
3-E7. Because u-PA did not digest the phage coat protein III 
sequence, the antibody epitopes, or the flexible linker sequence, 
the loss of antibody epitopes from the phage surface upon 
incubation with u-PA required cleavage of the randomized 
peptide insert. Incubation of the library with u-PA, followed by 
removal of phage retaining the antibody epitopes, therefore, 
accomplished a large enrichment of phage clones whose ran- 
dom hexamer sequence could be cleaved by u-PA. 

Analysis of Selected Phage Clones and Identification of a 
Consensus Sequence — Following five rounds of selection to en- 
rich and amphfy phage that display sequences that are readily 
cleaved by u-PA, 100 phage clones were identified as u-PA 
substrates. DNA sequencing of these clones revealed the pres- 
ence of 91 distinct hexamer sequences among the selected 
phage (Table I). As expected from the trypsin-like primary 
specificity of u-PA, each hexamer contained at least one basic 
residue, and 89 of the 91 hexamer sequences contained at least 
one arginine residue. 35 of the 91 substrate phage contained a 
single basic residue, and in 33 of these 35 cases the single basic 



Optimal Subsite Occupancy for u-PA 



20459 



Table II 

ice of the randomized hexamer present ir. 
labile u-PA substrate phage 



residue was an arginine. An additional 22 phage contained two 
basic residues but only a single arginine. Alignment and anal- 
ysis of these hexamer sequences suggested that the consensus 
sequence for optimal subsite occupeincy for substrates of u-PA, 
from P3 to P2', was SGR(S > R,K,A)X, where X represents a 
variety of amino acid residues but was most often alanine, 
glycine, serine, valine, or arginine. 

Analysis of these data was complicated by the fact that 
approximately 72% of the selected substrate phage contained 
an arginine in the first position of the randomized hexamer and 
therefore utilized the amino-terminal flanking residues, Ser- 
Gly, to occupy the P3 and P2 subsites. While these results left 
no doubt that the P3-P1 SGR sequence created by the fusion 
was a very favorable recognition site for u-PA, this use of 
flanking residues necessitated a particularly careful examina- 
tion of the P3 and P2 preferences of u-PA. Consequently, we 
altered our experimental protocol in two ways to address this 
issue. First, we isolated an unusually large collection of sub- 
strate phage (91 distinct substrates) to ensure that a reasona- 
ble number of these (23) would not utilize the flanking Ser-Gly 
to fill the P3 and P2 subsites. This allowed a meaningful 
comparison of the consensus sequence derived irom the entire 
library with that derived irom the non-fusion phage and the 
demonstration of good agreement between the two consensus 
sequences. Second, we performed a previously described dot 
blot analysis (26, 31) of the digestion of all 100 substrate phage 
by u-PA using a wide variety of stringencies of digestion. Al- 
though this semiquantitative assay cannot provide kinetic con- 
stants, it can provide an accurate rank ordering of the lability 
of the substrate phage clones. 

Under the most stringent conditions examined, 11 of the 100 
substrate phage, containing eight distinct randomized hex- 
amer sequences, proved to be particularly labile u-PA sub- 
strates (Table II). All eight of the most labile substrate phage 
contained the P3-P1 SGR motif, demonstrating that this se- 
quence is, in fact, a more labile u-PA site than related, selected 
sequences present in the library such as SSR, TAR, TSR, TTR, 
etc. This dot blot analysis also yielded additional information 
regarding the preferences of u-PA for the unprimed subsites. 
While analysis of the entire substrate phage library failed to 
reveal a clear consensus at PI' and P2', the most labile sub- 
strate phage displayed an obvious preference at both of these 
positions. Five of the eight most labile phage contained a serine 
residue at PI', and seven of these eight phage contained an 
alanine residue at P2'. These observations strongly suggest 
that the primary sequence SGRSA, from P3 to P2', represents 
optimal subsite occupancy for substrates of u-PA. 

Kinetic Analysis of the Cleavage of Peptides Containing Se- 
quences Present in Selected Substrate Phage — Four peptides 
containing amino acid sequences present in the randomized 
hexamer region of the most labile phage were chosen for de- 
tailed kinetic analysis (Table III) and compared with the hy- 
drolysis of a control peptide (I) containing the P3-P4' sequence 
of plasminogen, a series of residues that fall within a disulflde- 



linked loop in the native protein. All four of the selected pep- 
tides were substantially improved substrates for u-PA, by fac- 
tors of 840-5300, compared with the control, plasminogen 
peptide (Table III). These increases in catalytic efficiency were 
mediated primarily by increases in k^^^, suggesting that opti- 
mized subsite interactions served to lower the energy of the 
transition state rather than the ground state. For example, 
compared with that of control peptide (I), the for cleavage of 
the most labile selected peptide (II) was reduced by a factor of 
5.6; however, the k^^^ was increased by a factor of more than 
940. In addition, peptide substrates that interacted optimally 
with the primary subsites of u-PA were selective for cleavage 
by u-PA relative to t-PA. The four selected peptides (II-V), for 
example, were cleaved 16-89 times more efficiently by u-PA 
than by t-PA, and improvements in both and k^^i contrib- 
uted to the preferential hydrolysis by u-PA. 

Minimization of the Selective Peptide Substrates — The ki- 
netic analysis described above was performed using substrate 
peptides that were 14 amino acids in length. To confirm that 
the specificity we observed was inherent in the selected 
hexapeptide sequences, we examined the kinetics of cleavage of 
short peptides containing only sequences found within selected 
hexapeptide sequences. Pentapeptide VII, for example, was 
cleaved by u-PA with a catalytic efficiency of 1200 s"^ and 
exhibited a u-PA/t-PA selectivity of 20. The behavior of penta- 
mer VII in these assays, therefore, was very similar to that of 
peptide IV, a 14-mer that contains the same P3-P2' sequence 
as the pentamer. These observations indicate that appropriate 
occupancy of the P3-P2' subsites alone can create selective 
substrates for u-PA. 

Effect of Lysine versus Arginine at PI — Differences at posi- 
tion 190 (chymotrypsin numbering system) between u-PA and 
t-PA suggest that u-PA may exhibit decreased discrimination 
between arginine and lysine at the PI position of a substrate 
compared with t-FA (44). Consistent with this hypothesis and 
in contrast to the selected t-PA substrate library, the u-PA 
library did include members that contained a PI lysine. This 
observation suggested that the u-PA/t-PA selectivity of a pep- 
tide substrate might be enhanced by placement of lysine in the 
PI position, although this increased selectivity was likely to be 
accompanied by decreased reactivity toward u-PA. To test this 
hypothesis, we analyzed hydrolysis of a variant of u-PA selec- 
tive peptide (VI) that contained a PI lysine (peptide VIII). The 
PI lysine mutation decreased the catalytic efficiency for cleav- 
age of this peptide by a factor of 49 for t-PA and by a factor of 
7 for u-PA. As predicted, then, the PI lysine mutation did 
enhance the u-PA/t-PA selectivity of the peptide substrate by a 
factor of approximately 7. It is not surprising, therefore, that 
the most selective u-PA substrate, peptide IX, which is cleaved 
approximately 121 times more efficiently by u-PA than by t-PA, 
is derived from the randomized hexamer region of a substrate 
phage that contained a PI lysine. 

Importance of P3 and P4 for Discrimination between u-PA 
and t-PA — Recent investigations that explored optimal subsite 
occupancy for substrates of t-PA suggested that the P3 residue 
was the primary determinant of the ability of a substrate to 
discriminate between t-PA and u-PA and that this selectivity 
could be enhanced modestly by appropriate occupancy of P4 
(11). These suggestions were based on evidence obtained fi-om 
a statistical analysis of phage selected using a substrate sub- 
traction protocol rather than by a kinetic analysis of peptide 
substrates. Consequently, to test these hypotheses, we synthe- 
sized variants of the most labile u-PA-selective substrate (pep- 
tide II) that contained mutations in the P3 and/or P4 positions 
and analyzed the hydrolysis of these peptides by u-PA and 
t-PA. In peptide X the P3 serine of peptide II was replaced by 



Optimal Subsite Occupancy for u-PA 







K„ 






K„ 






















Native cleavage sequence from plasmino 
















I KKSPGR 1 WGGSVAH 


0.003 


3400 


0.88 


0.0043 


15000 


0.29 


3.0 


u-PA-selective peptides 
















II LGGSGR i SANAILE 


2.83 


603 


4700 


0,305 


4080 


75 


63 


III LGGSGR i NAQVRLE 
















IV LGGSGR i SATRDLE 


0.54 


733 


740 


0.068 


1500 


45 


16 


V LGGSGR i KASLSLE 


1.14 


1130 


1010 


0.168 


5100 




31 


Minimized, u-PA-selective peptides 
















VI SGR i S 


2.3 


2100 


1100 


5.0 


15,000 


330 




VII SGR I SA 


3.7 


3100 


1200 




40,000 


60 


20 


VIII SGK i S 


1.22 


7900 




0.19 


28,000 


6.8 


23 


IX GSGK I S 


0.82 


4250 


193 


0.07 


44,000 


1.6 


121 


Variants of u-PA-selective peptides 


















0.7 


2200 


318 


3.29 


1850 


1800 


0.018 


XI LGGRGR i SANAILE 


0.08 


1200 


67 


0.85 


2400 


350 


0.019 


XII LGQRGR i SANAILE 


0.068 


1500 


45 


2.55 


3000 


850 


0.005 



id between PI and 



I 



a tyrosine, and in peptide XI the P3 serine was replaced by 
arginine. As expected, these mutations substantially decreased 
the u-PA/t-PA selectivity of the peptide by a factor of 330 or 
360, respectively, and actually converted the peptide into a 
t-PA-selective substrate. Moreover, mutation of both the P3 
serine and P4 glycine of the most labile u-PA substrate to 
arginine and glutamine, respectively (peptide XII), decreased 
the u-PA/t-PA selectivity by a factor of 1200. These data con- 
firm the proposed status of the P3 and P4 residues as specificity 
determinsuits for substrates of t-PA and u-PA and suggest a 
particularly prominent role of the P3 residue in this capacity. 

Design and Characterization of a Variant of PAI-1 That Is 
Selective for u-PA — To test the prediction that information 
gained from the study of peptide substrates could facilitate the 
design of selective, high affinity inhibitors of urokinase, we 
sought to augment the u-PA/t-PA selectivity of the serpin 
PAI-1, the primary physiological inhibitor of both t-PA and- 
u-PA. We used oligonucleotide-directed, site-specific mutagen- 
esis to construct a variant of PAI-1 that contained the primary 
sequence found in the peptide substrate that was most selec- 
tive for u-PA, GSGKS, from the P4-P1' position of the reactive 
center loop. Kinetic analysis indicated that the PAI-1 variant 
inhibited u-PA approximately 70 times more rapidly them it 
inhibited t-PA with second order rate constants for inhibition of 
u-PA and t-PA of 6.2 X 10« m"' s"' and 9 x 10^ m'^ s-\ 
respectively (Table IV). In contrast, wild type PAI-1 inhibits 
u-PA and t-PA with second order rate constants of 1.9 x 10' 
s"^ and 1.8 X 10® s~^, respectively. As anticipated, 
therefore, the mutated serpin possessed a u-PA/t-PA selectivity 
that was approximately 7-fold greater than that of wild type 
PAI-1. Moreover, the 70-fold selectivity of the PAI-1 variant 
is consistent with the value of 120 observed for hydrolysis of 
the corresponding peptide substrate by the two enzymes (Ta- 
bles III and IV). 

DISCUSSION 

Substrate Phage Can Elucidate Specificity Differences be- 
tween Closely Related Enzymes — u-PA and t-PA possess dis- 
tinct but overlapping physiological and pathological roles, and 
the ability to selectively inhibit either enzyme with small mol- 
ecules would allow these roles to be examined comprehensively 
both in vitro and in vivo. Normally, such inhibitor design would 
be based on knowledge of the sequences of endogenous protein 
substrates or inhibitors. This approach, however, is not possi- 
ble with u-PA £ind t-PA because these enz3rmes share the same 



physiological substrate, plasminogen, and inhibitor, PAI-1. 
Furthermore, this similarity calls into question the hypothesis 
that highly selective inhibitors can be generated, since the 
specificities of the two enzymes appear so similar. We find, 
however, both in this study and in a previous study aimed at 
the design of t-PA selective substrates (11), that there are 
subtle but significant differences in optimal subsite occupancy 
between the two enzymes, and these distinctions can be eluci- 
dated by substrate phage display protocols. 

Sequences Selected for Optimal Cleavage Do Not Resemble 
the Physiological Target Sequence — A key observation of this 
study is that the primary sequence SGRSA, from the P3-P2' 
positions of a peptide substrate, affords highly labile subsite 
occupancy for urokinase. This sequence differs at P3, PI', and 
P2' from the target sequence found in plasminogen (PGRW) 
and is cleaved by u-PA greater than 5300 times more effi- 
ciently. This major discrepancy, in both primary sequence and 
lability, of the physiological target sequence and the consensus 
sequence derived using substrate phage display protocols sug- 
gests that a physiological target sequence is not necessarily a 
reasonable lead compound for the design of specific, small 
molecule substrates or inhibitors of highly selective serine 
proteases. 

A major contribution to the discrepancy between the physi- 
ological and consensus target sequences of u-PA almost cer- 
tainly arises from the highly conserved mechanism of zymogen 
activation of chymotrypsin family enzymes (45). Following ac- 
tivation cleavage of a chymotrypsinogen-like zymogen, the PI' 
and P2' residues insert into the activation pocket, where they 
form a number of conserved hydrophobic interactions as well as 
a new, buried salt bridge with the aspartic acid residue adja- 
cent to the active site serine (45-47). Because these interac- 
tions substantially stabilize the active conformation of the ma- 
ture enzyme, this key role after activation cleavage places 
severe functional constraints on the PI' and P2' residues of a 
chymotrypsinogen-like zymogen and consequently prevents 
the two residues from evolving simply to interact optimally 
with the activating enzyme. Consistent with this hypothesis, 
the consensus and physiological target sequences for u-PA 
agree well on the unprimed side of the scissile bond; however, 
the two target sequences diverge dramatically at the PI' and 
P2' subsites. 

Additional factors are also likely to contribute to the ob- 
served discrepancy between the consensus and physiological 



Optimal Subsite Occupancy for u-PA 



(P4-P2') 



target sequences for u-PA. For example, modeling studies re- 
ported by Lamba, Huber, Bode and co-workers (8) suggest the 
SI' and/or the S2' pockets utilized by u-PA when hydrolyzing 
plasminogen may actually differ from those used when hydro- 
lyzing peptide substrates. Moreover, as the enzyme diverged 
from a trypsin-like precursor, u-PA may have evolved a strong 
dependence for efficient catalysis upon productive interactions 
with substrates at secondary sites that diminished the contri- 
bution of optimal interactions with primary subsites in the 
active site cleft. Although the location, role, and even the ex- 
istence of such secondary contacts between u-PA and plasmin- 
ogen remain obscure at the present time, previous studies of 
the interaction of u-PA and t-PA with PAI-1 have demonstrated 
very clearly that these two enzymes are capable of using spe- 
cific, secondary contacts efficiently both to enhance selectivity 
and to dampen the influence of optimal primary subsite inter- 
actions (42, 48-50). Although the reactive center loop of PAI-1 
has evolved to match optimal subsite occupancy for urokinase 
very closely, in the absence of productive contact with a single, 
strong secondary site of interaction between the two proteins, 
PAI-1 becomes a poor inhibitor of u-PA (50). 

Implications Regarding the Possibility of Additional Physio- 
logical Substrates for u-PA — The identification of synthetic 
peptides that are cleaved up to 120 times more efficiently by 
u-PA than by t-PA raises the possibility that similar u-PA- 
selective (or t-PA-selective) physiological substrates may exist 
that are currently not appreciated. Differences in the pheno- 
types exhibited by mice lacking either of the two enzymes are 
consistent with this possibility (5, 6). This issue remains un- 
certain, however, because selective expression of t-PA or u-PA 
in particular microenvironments could also account for these 
distinct phenotypes. 

Importance of the P3 Residue in Discriminating between 
u-PA and t-PA — By demonstrating that mutation of the P3 
residue alone could alter the relative u-PA/t-PA selectivity of a 
peptide substrate by a factor of greater than 300 (Table III), 
this study provided strong support for the hypothesis that the 
P3 residue was the primary determinant of the ability of a 
substrate to discriminate between u-PA and t-PA. We have 
previously reported that occupancy of P3 by arginine or large 
aromatic or hydrophobic residues favored cleavage by t-PA 
(11), and this investigation showed that a P3 serine residue 
favored cleavage by u-PA. In addition, this study demonstrated 
that more modest alterations of specificity could be achieved by 
selective occupancy of the P4 and PI subsites. These data 
indicated that PAI-1, which contains a P3 serine, has evolved to 
match optimal subsite occupancy of u-PA more closely than 
that of t-PA. This observation may explain why PAI-1 inhibits 
u-PA more rapidly than it inhibits t-PA (Table IV) and suggests 
that, during the evolution of the fibrinolytic system, there may 
have been a greater need to suppress the activity of u-PA in the 
circulation than to regulate t-PA activity. Consistent with this 
hypothesis, the circulating, single chain form of u-PA is a true 
zjnnogen, while t-PA is secreted into the circulation as an 
active, single chain enzyme. 

Substrate Phage Display Can Aid Inhibitor Design — Another 
implication of these studies is that information gained fi-om the 
application of substrate phage display libraries can lead di- 



rectly to the design of specific inhibitors. Although hydrolysis of 
the selective, small peptide substrates by u-PA is characterized 
by values in the 0.6-3 mM range, it has been routinely 
observed that the introduction of a transition state bond geom- 
etry adjacent to the PI residue of a protease substrate can 
create either a reversible inhibitor whose affinity for the target 
protease is enhanced by 3-6 orders of magnitude or an irre- 
versible inhibitor with an impressive second order rate con- 
stant for inhibition of the target protease (>10^ s~') (for a 
review, see Ref. 51). Similar results using the substrates iden- 
tified in this study would create highly selective, small mole- 
cule u-PA inhibitors, with affinities in the low nanomolar 
range, that might be further improved by subsequent, system- 
atic chemical modification. 

Conclusion— The ability to identify subtle but significant 
specificity differences between enzymes that share the same 
physiological substrates and inhibitors, as demonstrated in 
this study, is a fundamental challenge both for basic enzymol- 
ogy and rational drug design. Advances in this area will sig- 
nificantly enhance imderstanding of the molecular determi- 
nants and mechanisms of specific catalysis and may facilitate 
the design of highly selective and therapeutically valuable new 
enzymes. 

Acknowledgments — We thank Steven Madden, Bikash C. Pramanik, 
and Dr. Clive Slaughter for peptide synthesis and mass spectral 

REFERENCES 

1. Roberts, H. R., andTabares, A. H. (1995) in Mo(ecuiarBasis o/TAramfcosis ond 

Hemostasis (High. K. A., Roberts, H. R., eds) pp. 35-50, Marcel Delcker, 
New York 

2. Castellino, F. J. (1995) in Molecular Basis of Thrombosis and Hemostasis 

(High, K. A., Roberts, H. R., eds) pp. 495-515, Marcel Dekker, New York 

3. Collen, D., and LJjnen, H. R. (1991) Blood 78, 3114-3124 

4. Madison, E. L. (1994) Fibrinolysis 8, Suppl. 1, 221-236 

5. Carmeliet, P., Schoonjans, L., Kieckens, L., R«am, B., Degen, J., Bronson, R., 

De Vos, R, van den Oord, J. J., Collen, D., and Mulligan, R. C. (1994) 
Nature 368, 419-424 

6. Canneliet, P., and Collen, D, (1996) Fibrinolysis 10, 195-213 

7. Spraggon, G., Phillips, C, Nowak, U. K., Ponting, C. P., Saunders, D.. Dobson, 

C. M., Stuart, D. I., and Jones, E, Y, (1995) Structure 3, 681-691 

8. Lamba, B., Bauer, M., Huber, R., Fischer, S., Rudolph, R., Kohnert, U., and 

Bode, W. (1996) J. Mol. Biol. 258, 117-135 

9. Madison, E. L., Coombs, G. S., and Corey, D. R. (1995) J. Biol. Chem. 270, 

7558-7562 

10. Coombs, G. S., Dang, A. T., Madison, E. L., and Corey, D. R. (1996) J. Biol 

Chem. 271, 4461-4467 

11. Ke, S.-H., Coombs, G. S., Tachias, K., Navre, M., Corey, D. R., and Madison, 

E. L. (1997) J. Biol. Chem. 272, 16603-16609 

12. Dan0, K., Andreason, P. A,. Grondahl-Hansen, J., Kristensen, P., Nielsen, 

L. S., and Skriver, L. (1985) Adu. Cancer Res. 44, 139-266 

13. Tsirka, S. E., Gualandris, A., Amaral, D, G., and Strickland, S. (1995)Arii(ure 

377, 340-344 

14. Fukudome, K., and Esmon, C. T. (1994) J. Biol. Chem. 269, 26486-26491 

15. Min, H. Y., Doyle, L. V., Vitt, C. L., Zandonella, C. L., Stratton-Thomas, J. R., 

Shuman, M. A., and Rosenberg, S. (1996) Cancer Res. 56, 2428-2433 

16. Seeds, N. W., Williams, B. L., and Bickford, P. C. (1995) Science 270, 

1992-1994 

17. Ossowski, L. (1988) Cell 52, 321-328 

18. Shapiro, R. L., Duquette, J. G., Roses, D. P., Nunes, I., Harris, M. N., Kamino, 

H., Wilson, E. L., and Rifldn, D. B. (1996) Cancer Res. 56, 3597-3604 

19. Ossowski, L., and Reich, E. (1983) Cell 35, 611-619 

20. Hearing, V. J., Law, L. W., Corti, A., Appella, E., and Blasi, F. (1988) Cancer 

Res. 48, 1270-1278 

21. Briinner, N„ Hoyer-Hansen, G., R0mer, J., Ellis, V., Holst-Hansen, C, Spang- 

Thomsen, M., and Dan0, K, (1992) Proc. Am. Assoc. Cancer Res. 33, 61 

22. Duffy, M. J., O'Grady, P., Devaney. D., O'Siorain, L., Fennelly, J. J., and 

Lijnen, H. J. (1988) Concer 62, 531-533 

23. Janicke, F., Schmitt, M., Hafter, R., Hollrieder, A., Babic, R., Ulm, K., GBssner, 

W., and Graeff, H. (1990) Fibrinolysis 4, 69-78 



20462 



Optimal Subsite Occupancy for u-PA 



24. Duffy, M. J. (1993) Fibrinolysis 7, 295-302 

25. Matthews. D. J., and Wells, J. A. (1993) Science 260, 1113-1117 

26. Smith, M. M., Shi, L„ and Navre, M. (1995) J. Biol. Chem. 270, 6440-6449 

27. Loskutoff, D. J. (1993) J. Clin. Inuest. 92, 2563 

28. van Meijer, M., and Pannekoek, H. (1996) Fibrinolysis 9, 263-276 

29. Jameson, G. W., Roberts, D. V., Adams, R. W., Kyle, S. A., and Ellmore, D. T. 

(1973) BiocAem. J. 131, 107-117 

30. Urano, T., Urano, S., and CastelHno, F. J. (1988) Biochem. Biophys. Res. 

Commun. 150, 45-51 

31. Ding, L., Coombs, G. S., Strandberg, L., Navre, M., Corey, D. R., and Madison, 

E. L. (1995) Proc. Natl. Acad. Sci. V. S. A. 92, 7627-7631 

32. Alport, A, J. (1990) J. Chromatogr. 499, 177-196 

33. Taylor, J. R. (1982) An Introduction to Error Analysis: The Study of 

Uncertainties in Physical Measurements, University Science Books, Mill 
Valley, CA 

34. Tucker, H. M., Mottonen, J., Goldsmith, E. J., and Gerard, R, D. (1995) Nat. 

Struct. Biol. 2, 442-445 

35. Zoller, M. J., and Smith, M. (1984) DNA 3, 479-488 

36. Kunkel, T. A. (1985) Proc. Natl. Acad. Sci. U. S. A. 82, 488-492 

37. Sancho, E., Tonge, D. W., Hockney, R. C, and Booth, N. A. (1994) Eur 

J. Biochem. 224, 125-134 

38. Chase, T., and Shaw, E. (1967) BtocAem. Biophys. Res. Commun. 29, 508-514 



39. Olson, S, T., Bock, P. E., Kvassman, J., Shore, J. D., Lawrence, D. A., 

Ginsburg, D., and Bjbrk, 1. (1995) J. Biol. Chem. 270, 30007-30017 

40. Holmes, W. E., Lijnen, H. R., and Collen, D. (1987) Biochemistry 26, 

5133-5140 

41. Beatty, K., Bieth, J., and Travis, J. (1980) </. Biol. Chem. 255, 3931-3934 

42. Madison, E. L., Goldsmith, E. J., Gerard, R. D., Gething, M. J., Sambrook, 

J. F., and Bassel-Duby, R. S. (1990) Proc. Natl. Acad. Sci. U. S. A. 87, 
3530-3533 

43. Madison, E. L., and Sambrook, J. F. (1993) Methods Emymol. 223, 249-271 

44. Bode, W., Turk, D., and Karshikov, A. (1992) Protein Sci. 1, 426-471 

45. Huber, R., and Bode, W. (1978) Acc. Chem. Res. 11, 114-122 

46. Freer, S. T., Kraut, J., Robertus, J. D., Wright, H. T., and Xuong, N. H. (1970) 

Biochemistry 9, 1997-2009 

47. Fehlhammer, H., Bode, W., and Huber, R. (1977) J. Mol. Biol. Ill, 415-438 

48. Madison, E. L., Goldsmith, E. J., Gerard, R. D., Gething, M.-J., and Sambrook, 

J. F. (1989) Nature 339, 721-724 

49. Madison, E. L., Goldsmith, E. J., Gething, M. J., Sambrook, J. F„ and Gerard, 

R, 0. (1990) J. Biol. Chem. 265, 21423-21426 

50. Adams, D. S., Griffin, L. A., Nachajko, W. R., Reddy, V. B., and Wei, O.-M, 

(1991) J. Biol. Chem. 266, 8476-8482 

51. Baggio, R., Shi, Y.-Q., Wu, Y.-Q., and Abeles. R. H. (1996) Biochemistry 35, 



The Journal of Biological Chemistry Vol. 272, No. 26, Issuo of June 27, pp. 16603-16609, 1997 

© 1997 by The American Society for Biochemistry and Molecular Biology, Inc. Printed in U.S.A. 

Distinguishing the Specificities of Closely Related Proteases 

ROLE OF P3 IN SUBSTRATE AND INHIBITOR DISCRIMINATION BETWEEN TISSUE-TYPE PLASMINOGEN 
ACTIVATOR AND UROKINASE* 

(Received for publication, November 19, 1996, and in revised form, April 16, 1997) 

Song-Hua Ke^:, Gary S. Coombs§, Kathy Tachias^, Marc Navrelt, David R. Corey§||, and 
Edwin L. Madisont** 

From the tDepartment of Vascular Biology, The Scripps Research Institute, La Jolla, California 92037, the department 
of Pharmacology and Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 
75235, and \\Affymax Research Institute, Santa Clara, California 95051 



5 

5 

f 

1 



Elucidating subtle specificity differences between 
closely related enzymes is a fundamental challenge for 
both enzymology and drug design. We have addressed 
this issue for two intimately related serine proteases, 
tissue-type plasminogen activator (t-PA) and urokinase- 
type plasminogen activator (u-PA), by modifying the 
technique of substrate phage display to create substrate 
subtraction libraries. Characterization of individual 
members of the substrate subtraction library accom- 
plished the rapid, direct identification of small, highly 
selective substrates for t-PA. Comparison of the amino 
acid sequences of these selective substrates with the 
consensus sequence for optimal substrates for t-PA, de- 
rived using standard substrate phage display protocols, 
suggested that the P3 and P4 residues are the primary 
determinants of the ability of a substrate to discrimi- 
nate between t-PA and u-PA. Mutagenesis of the P3 and 
P4 residues of plasminogen activator inhibitor type 1, 
the primary physiological inhibitor of both t-PA and 
u-PA, confirmed this prediction and indicated a predom- 
inant role for the P3 residue. Appropriate replacement 
of both the P3 and P4 residues enhanced the t-PA spec- 
ificity of plasminogen activator inhibitor type 1 by a 
factor of 600, and mutation of the P3 residue alone in- 
creased this selectivity by a factor of 170. These results 
demonstrate that the combination of substrate phage 
display and substrate subtraction methods can be used 
to discover specificity differences between very closely 
related enzymes and that this information can be uti- 
lized to create highly selective inhibitors. 



The chymotrypsin family of serine proteases has evolved to 
include members with both widely divergent and intimately 
related substrate specificities (1). We chose two members of 
this family, tissue-tjTpe plasminogen activator (t-PA)"- and 

urokinase (u-PA), to test the hypothesis that small molecule 

* This study was funded in part by National Institutes of Health 
Grants ROl HL52475 and POl HL31950 (to E, L. M.). The costs of 
publication of this article were defrayed in part by the payment of page 
charges. This article must therefore be hereby marked "advertisement" 
in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 

H An assistant investigator with the Howard Hughes Medical Insti- 
tute. To whom correspondence may be addressed: Dept. of Pharmacol- 
ogy and Howard Hughes Medical Institute, University of Texas South- 
western Medical Center, Dallas, TX 75235. 

** To whom correspondence may be addressed: Dept. of Vascular 
Biology (VB-1), The Scripps Research Institute, La Jolla, CA 92037. 
Fax: 619-784-7323. 

' The abbreviations used are: t-PA, tissue-type plasminogen activa- 
tor; u-PA, urokinase-type plasminogen activator; PAI-1, plasminogen 
activator inhibitor type 1; mAb, monoclonal antibody; fTC, fAFFl- 
tether C; HPLC, high pressure liquid chromatography. 



libraries could be used to identify substrates that discriminate 
between closely related enzymes. This choice of enzymes as- 
sured a rigorous test of the hypothesis because t-PA and u-PA 
possess an extremely high degree of structural similarity (2, 3), 
share the same primary physiological substrate (plasminogen) 
and inhibitors (plasminogen activator inhibitor types 1 and 2) 
(4, 5), and exhibit restricted substrate specificity (6-8). 

Despite their striking similarities, the physiological roles of 
t-PA and u-PA are distinct (9), and many studies (10-16), 
including several that utilize transgenic mice (9, 11, 16), sug- 
gest that selective inhibition of either enzyme might have ben- 
eficial therapeutic effects. Mice lacking t-PA, for example, are 
resistemt to specific excitotoxins that cause extensive neurode- 
generation in wild type mice (11), and mice lacking u-PA ex- 
hibit defects in the prohferation and/or migration of smooth 
muscle cells in a model of restenosis following vascular injury 
(9). u-PA-deficient mice are also resistant to the induction 
and/or progression of several tumor types in a two-stage, chem- 
ical carcinogenesis model (16). 

Because mice lacking either t-PA or u-PA do not develop 
thrombotic disorders, selective inhibition of either of these two 
enzymes seems unlikely to create thrombotic complications in 
vivo. On the other hand, mice lacking both t-PA and u-PA 
suffer severe thrombosis in many organs and tissues, resulting 
in a significantly reduced life expectancy (9). Nonselective in- 
hibition of these two enzymes, therefore, seems almost certain 
to produce catastrophic consequences in the clinical setting. 
Consequently, significant interest exists in the development of 
inhibitors that are stringently specific for either t-PA or u-PA. 
Rational design of these selective inhibitors is greatly compli- 
cated, however, by the absence of obvious "lead compounds"; 
both their primary physiological substrate and inhibitors fail to 
discriminate between the two closely related proteases. 

We have previously described the use of substrate phage 
display, a strategy originally developed by Matthews and Wells 
(17), to elucidate optimal subsite occupancy for substrates of 
t-PA and to isolate peptide substrates that were cleaved as 
much as 5300 times more efficiently by t-PA than peptides 
containing the primary sequence of the actual target site pres- 
ent in plasminogen (18). Four of these selected substrates, 
including the most labile t-PA substrate, were chosen for de- 
tailed characterization; consequently, small peptides contain- 
ing the four selected amino acid sequences were synthesized, 
and a kinetic analysis of the cleavage of these peptides by t-PA 
was performed (18). Subsequent analysis of these four selected 
substrates demonstrated that they were also efficiently cleaved 
by u-PA £ind therefore were not selective for t-PA versus u-PA. 
Consequently, to facilitate the rapid isolatio^o^gegtid^ub- 



Thisi 



/ailable on line at http://vifww.ibc.org 



EXHIBIT 



16604 



Phage Substrate Subtraction Libraries 



strates that were cleaved at greater catalytic efficiencies by 
t-PA than by u-PA, we developed a novel protocol to prepare 
phage "substrate subtraction libraries" (Fig. 1). Characteriza- 
tion of individual members of the substrate subtraction librsury 
quickly accomplished the direct identification of highly labile, 
small peptide substrates that were preferentially cleaved by 
t-PA. In addition, insights gained during the analysis of these 
selective substrates were used to design a variant of the serpin 
PAI-1 whose selectivity toward t-PA was enhanced by a factor 
of 600. 

MATERIALS AND METHODS 

fleagen<s— Competent MC1061 (F") Escherichia coli and nitrocellu- 
lose were purchased from Bio-Rad. Pansorbin (Protein A-bearing 
Staphylococcus aureus) cells were obtained from Calbiochem. K91 (F*) 
and MC1061 (F") strains of E. coli were provided by Steve Cwirla 
(Affymax). mAb 179 which recognizes the epitope (ACLEPYTACD) of 
the human placental alkaline phosphatase protein with subnanomolar 
affinity, was provided by Ron Barrett (Affymax). mAb 3-E7 was pur- 
chased from Gramsch Laboratories (Schwabhausen, FRG). t-PA was 
obtained from Bruce Keyt (Genentech), and u-PA was obtained from 
Jack Henkin (Abbott Laboratories). 

Construction of the phage vector fAFFl-tether C (fTC) and the ran- 
dom hexapeptide library iAFF-TC-LIB has been previously described 
(19). Control substrate phage iTC-PL was constructed by hybridizing 
the single-stranded oligonucleotides 5'-TCGAGCGGTGGATCCGGTA- 
CTGGTCGTACTGGTCATGCTCTGGTAC-3' and 5'-CGCCACCTAGG- 
CCAGGACCAGCACAACAACCACGAGAC-3' and then ligating the an- 
nealed, double-stranded products into the XhoVKpnl-cut vector fTC. All 
constructs were first transformed into MC1061 by electroporation and 
then transferred into K91. 

Measurement of Enzyme Concentrations — Concentrations of func- 
tional t-PA and u-PA were measured by active site titration with 
4-methylumbelliferyl p-guanidinobenzoate (20) using a Perkin-Elmer 
LS SOB luminescence fluorometer as described previously (7, 21). In 
addition, the enzymes were titrated with a standard PAI-1 preparation 
that had been previously titrated against a trypsin primary standard. 
Total enzyme concentrations were measured by enzyme-linked immu- 
nosorbent assay. 

Phage Selection Using t-PA or u-PA — Conventional substrate phage 
display was originally developed by Matthews and Wells (17) using 
monovalent phage, and another method that used multivalent phage 
was reported later (19). Multivalent substrate phage were screened 
with t-PA using reaction conditions reported previously (18). Identical 
reaction conditions were used to screen the phage Hbrary with u-PA 
except that digestion of the phage was performed using enzyme con- 
centrations varying from 5 to 10 fig/ml and incubation times varying 
from 1 to 10 h. 

Preparation of Substrate Subtraction Libraries — The initial random 
hexapeptide library fAFF-TC-LIB was subjected to three rounds of high 
stringency screening with t-PA (18) to prepare an intermediate library 
containing phage whose randomized hexamer sequences were digested 
efficiently by t-PA. The intermediate library was then amplified and 
screened, at low stringency, with u-PA. Following digestion of the 
intermediate library with u-PA, mAb E-7 and immobilized protein A 
were added to the mixture, and the resulting ternary complexes were 
pelleted by centrifugation as described previously (18, 19). In contrast 
to all previous screening steps, however, we retained the precipitated 
ternary complexes and discarded the supernatant, which contained 
phage that were digested by u-PA. The precipitated ternary complexes 
were washed four times with 50 mM Tris (pH 7.5), 0.1 M NaCl, 1 mM 
EDTA, 0.01% Tween 80, resuspended, and digested with 50 fig/ml t-PA 
at 37 °C for 4 h. The remaining ternary complexes were precipitated by 
centrifugation and discarded, and the supernatant, which contained 
t-PA-selective phage, was retained. Phage in the final supernatant were 
amplified overnight in E. coli K91 cells. Following amplification, indi- 
vidual phage clones were functionally characterized using a dot blot 
assay and analyzed by DNA sequencing. 

Dot Blot Assay of Phage Proteolysis — Phage precipitation and dot blot 
analysis were performed as described previously (18, 19). Individual 
phage stocks were prepared and digested with no enzyme, t-PA, u-PA, 
or u-PA in the presence of 1 mM amiloride, a specific inhibitor of u-PA, 
for periods of time varying from 15 min to 10 h. Individual reaction 
mixtures were spotted onto a nitrocellulose filter using a dot blotter 
apparatus (Bio-Rad). The filter was probed with mAb 3E-7 and devel- 
oped using the Amersham Western ECL kit. Loss of positive staining 



indicates loss of antibody epitopes from the phage due to proteolytic 
cleavage of the randomized hexamer region. 

Preparation and Sequencing of DNA from Phage Clones — DNA sam- 
ples were prepared from interesting phage clones as described previ- 
ously (18). Briefly, phage are precipitated fi-om a 1-ml overnight culture 
by adding 200 jaI of 20% polyethylene glycol in 2.5 M NaCl. The mixture 
was incubated on ice for 30 rain, and the phage pellet was collected by 
microcentrifugation for 5 min. The phage were resuspended in 40 ^1 of 
lysis buffer (10 mM Tris-HCl, pH 7.6, 0.1 mM EDTA, 0.5% Triton X-100) 
and heated at 80 °C for 15 min. Single-stranded DNA was purified by 
phenol extraction and ethanol precipitation. One-third of the single- 
stranded DNA was used for dideoxy sequencing. 

Kinetics of Cleavage of Synthetic Peptides by t-PA and u-PA— Pep- 
tides were synthesized and purified as described (7). Kinetic data were 
obtained by incubating various concentrations of peptide with a con- 
stant enzyme concentration to achieve between 5 and 20% cleavage of 
the peptide in each reaction. For assays with u-PA, enzyme concentra- 
tion was either 815 or 635 nM. For assays with t-PA, enzyme concen- 
tration was 700 nM. Peptide concentrations were chosen where possible 
to surround and in all cases were between 1 and 32 mM. The buffer 
used in these assays has been described (7). Reactions were stopped by 
the addition of trifluoroacetic acid to 0.33% or by freezing on dry ice. 
Cleavage of the 13- and 14-residue peptides was monitored by reverse 
phase HPLC as described (7). The 4-6-residue peptides were acylated 
at their amino termini and amidated at their carboxyl termini. Cleav- 
age of the 4-6-residue peptides was monitored by hydrophilic interac- 
tion HPLC chromatography (22) using a polyhydroxyaspartamine col- 
umn from PolyLC (Columbia, MD). Buffer A was 50 mM triethylamine 
phosphate in 10% acetonitrile, and buffer B was 10 mM triethylamine 
phosphate in 80% acetonitrile. Peptides were eluted by a gradient that 
was varied from 100% buffer B to 100% buffer A during a 13-min 
interval. The percentage of cleaved peptide was calculated by dividing 
the area under the product peaks by the total area under substrate and 
product peaks. For all peptides containing multiple basic residues, 
mass spectral analysis of products confirmed that cleavage occurred at 
a single site and identified the scissile bond. Data were interpreted by 
Eadie-Hofstee analysis. Errors were determined as described (23) and 
were <25%. 

Site-directed Mutagenesis and Construction of Expression Vectors 
Encoding Variants of PAI-1— The expression vector pPAIST7HS was 
derived from the plasmid pBR322 and contains a full-length cDNA 
encoding human PAI-1 that is transcribed from a T7 gene 10 promoter 
(24). The 300-base pair SalVBamHl fragment of human PAI-1 was 
subcloned from pPAIST7HS into bacteriophage M13mpl8. Single- 
stranded DNA produced by the recombinant M13mpl8 constructs was 
used as a template for site-specific mutagenesis according to the method 
of Zoller and Smith (25) as modified by Kunkel (26). 

Following mutagenesis, single-stranded DNA corresponding to the 
entire 300-base pair Sall-BamHl fragment was fully sequenced to en- 
sure the presence of the desired mutation and the absence of any 
additional mutation. The 300-base pair Sall-BamHI double-stranded 
DNA fragments from mutated, replicative form DNAs were used to 
replace the corresponding firagment in pPAISTTHS to yield full-length 
cDNAs encoding PAI/P3R, PAI/P4Q, and PAI/P4Q,P3R. 

Expression and Purification of Recombinant PAI-1 Variants — Ex- 
pression of wild type and mutated variants of PAI-1 was accomplished 
in the E. coli strain BL21[DE3]pLys" (Novagen), which synthesizes T7 
RNA polymerase in the presence of isopropyl-l-thio-/3-D-galactopyrano- 
side. Bacterial cultures were grown at 37 °C with vigorous shaking to 
an A595 of 1.1-1.3, and isopropyl-l-thio-(3-D-galactopyranoside was 
added to a final concentration of 1 mM to induce the synthesis of 
T7 RNA polymerase and the production of PAI-1 proteins. Cultures 
were grown for an additional 1-2 h at 37 °C and then shifted to 30 °C 
for 2-6 h. 

Cells were pelleted by centrifugation at 8000 X g for 20 min at 4 °C 
and resuspended in 40 ml of cold start buffer (20 mM sodium acetate, 
200 mM NaCl, and 0.01% Tween 20, pH 5.6). The cell suspension was 
disrupted in a French pressure cell (Aminco), and cellular debris was 
removed by ultracentrifiigation for 25 min at 32,000 X g. 

Purification of soluble, active PAI-1 was performed as described 
previously (27). PAI-l-containing supernatants were injected onto a 
XK-26 column (Pharmacia Biotech Inc.) packed with CM-50 Sephadex 
(Pharmacia). The column was washed with S column volumes of start 
buffer (20 mM sodium acetate, 200 mM NaCl, and 0.01% Tween 20, pH 
5.6), and PAI-1 proteins were eluted using a 0.2-1.8 M linear gradient of 
NaCl in the same buffer. Peak fractions were collected, pooled, and 
concentrated using a centriplus 30 concentrator (Amicon). Purified 
preparations were analyzed by activity measurements using standard. 



Phage Substrate Subtraction Libraries 



16605 



direct assays of t-PA, SDS-polyacrylamide gel electrophoresis, and 
measurement of optical density at 280 nm. 

Measurement of Active PAI-l in Purified Preparations— k primary 
standard of trypsin was prepared by active site titration using jo-nitro- 
phenyl p'-guanidinobenzoate HCl as described previously (28). Concen- 
trations of active molecules in purified preparations of wild type or 
mutated PAI-ls were determined by titration of standardized trypsin as 
described by Olson et al. (29) and by titration of standardized t-PA 
preparations. 

Kinetic Analysis of the Inhibition of t-PA and u-PA by Wild Type and 
Mutated Variants of PAI-1— Second order rate constants (.k^) for inhi- 
bition of tPA or uPA were determined using pseudo-first order (Aj < 2 X 
10^) or second order (A, > 2 X 10") conditions. For each reaction, the 
concentrations of enzyme and inhibitor were chosen to yield several 
data points for which the residual enzymatic activity varied between 20 
and 80% of the initial activity. Reaction conditions and data analysis for 
pseudo-first order reactions were as described previously (30-33). 

For second order reactions, equimolar concentrations of u-PA and 
PAI-1 were mixed directly in microtiter plate wells and preincubated at 
room temperature for periods of time varying from 0 to 30 min. Follow- 
ing preincubation the mixtures were quenched with an excess of neu- 
tralizing anti-PAI-1 antibody (generously provided by Dr. David 
Loskutoff, The Scripps Research Institute), and residual enzymatic 
activity was measured using a standard, indirect chromogenic assay. 
These indirect, chromogenic assays were compared with control reac- 
tions containing no PAI-1 or with reactions to which PAI-1 was added 
after preincubation and the addition of anti-PAI-l antibody, plasmino- 
gen, and Spec PL. Data were analyzed by plotting the reciprocal of the 
residual enzyme concentration versus the time of preincubation. 

RESULTS AND DISCUSSION 

Construction of Substrate Phage Libraries — A polyvalent fd 
phage library that displayed random hexapeptide sequences 
and contained 2 X 10^ independent recombinants was prepared 
(18, 19). Each member of tiiis library displayed an N-terminal 
extension from phage coat protein III containing a randomized 
region of six amino acids followed by a six-residue linker se- 
quence {SS(3GSG) and the epitopes for mAb 179 and mAb 3-E7. 
Because neither t-PA nor u-PA digests the protein III sequence, 
the antibody epitopes, or the flexible linker sequence, the loss 
of antibody epitopes from the phage surface upon incubation 
with either enzyme requires cleavage of the randomized pep- 
tide insert. Incubation of the library with t-PA, followed by 
removal of phage retaining the antibody epitopes, therefore, 
accomplishes the enrichment of phage clones whose random 
hexamer sequence can be cleaved by t-PA. 

Construction of Substrate Subtraction Libraries — ^The initial 
phage library was subjected to three rounds of high stringency 
selection with t-PA to assure the preparation of an intermedi- 
ate library that is highly enriched for phage that are efficient 
substrates of t-PA (Fig. 1). This intermediate library was then 
digested at low stringency with u-PA to remove phage that are 
moderate or good substrates for u-PA. Substrate subtraction 
was accomplished after the protease digestion of phage by 
adding mAb 3E-7 and immobilized protein A to the reaction 
mixture and precipitating the ternary complexes that contain 
the undigested phage. By contrast to all earlier selections, the 
phage remaining in solution were discarded, and the precipi- 
tate containing the ternary complexes was resuspended. Phage 
that were preferentially cleaved by t-PA were then released 
from the ternary complexes by digestion with t-PA. 

Identification and Kinetic Characterization of t-PA- selective 
Substrates — Using the protocol outlined in Pig. 1 and a previ- 
ously described (18, 19) sensitive dot blot assay (Fig. 2), we 
isolated and functionally verified 37 t-PA-selective phage 
clones that contained 32 distinct substrate sequences (Fig. 3). 
As illustrated by the experiment depicted in Fig. 2, the dot blot 
assay can rapidly provide information regarding both the ac- 
tivity and specificity of individual substrate phage clones. 
Based on the results of these assays, three peptide substrates 
(II-IV) containing hexamer sequences present in individual 




t-PA selective phage 



Fig. 1. Outline of the protocol used to create substrate sub- 
traction libraries. The gene III fusion protein, phage, monoclonal 
antibodies, and immobilized protein A are not drawn to scale. 

PL 7 35 51 

No Enzyme • • • • 

tPA • • 

UFA + Amiloride • • • • 
uPA • • 

Fig. 2. Functional analysis of individual control or substrate 
phage stocks using a dot blot assay that has been previously 
described (18, 19). Loss of positive staining indicated removal of 
antibody epitopes from the phage by proteolytic cleavage of the random 
hexamer region. Control phage PL contained the P3-P3' region of the 
actual target sequence present in plasminogen (PGRWG) and was not 
digested by either enzyme under the conditions used in this experiment. 
Substrate phage 51 was isolated from the intermediate library used to 
create the t-PA substrate subtraction library, contained the hexamer 
RIARRA, and was an efficient substrate of both t-PA and u-PA. Phage 
7 was a member of the t-PA subtraction library, contained the hexamer 
FRGRAA, and, as predicted, was a t-PA-selective substrate. Phage 33 
was isolated from a conventional u-PA substrate library, contained the 
hexamer RSANAI, and was a u-PA-selective substrate. 

members of the substrate subtraction library were synthesized 
and characterized to provide a quantitative analysis of the 
properties of putative t-PA-selective substrates. These peptides 
were cleaved 180-1500-fold times more efficiently by t-PA than 
a control peptide (I) containing the physiological cleavage site 
present in plasminogen (Table I). In addition, by contrast to the 
plasminogen-derived, control peptide, which was cleaved 3-fold 
more efficiently by u-PA than by t-PA, the three selected pep- 
tides were cleaved 13-47 times more efficiently by t-PA than by 
u-PA. 

The t-PA/u-PA selectivity possessed by members of the sub- 
strate subtraction library was significantly greater than that 
displayed by members of an "optimized" substrate library con- 
structed using standard substrate phage display methods. We 



16606 



Phage Substrate Subtraction Libraries 



Qone P5 P4 P3 P2 PI P'l 



N T K S 



32 T R V R A K 

Fig. 3. Primary sequence of the randomized hexamer found in 
phage that were selective substrates of t-PA. Peptide sequences 
have been shifted to the left or right to align corresponding subsites of 
each individual hexamer sequence. Amino acids in parenthesis are 
flanking residues from the gene III fusion protein. 



have previously presented detailed, kinetic analysis of the 
cleavage by t-PA of four substrates from such an optimized 
library (18). Subsequent analysis, using the dot blot assay, 
revealed that these four selected substrates were also cleaved 
efficiently by u-PA and therefore were not highly selective for 
t-PA versus u-PA. In fact, one of these substrates, SRARKA, 
was actually cleaved more rapidly by u-PA than by t-PA. The 
remaining three selected substrates were cleaved slightly more 
rapidly in the dot blot assay by t-PA than by u-PA. To deter- 
mine the precise extent of selectivity toward t-PA exhibited by 
these substrates, we measured the catalytic efficiency of both 
t-PA and u-PA for hydrolysis of peptides containing these three 
selected amino acid sequences (Table I, peptides V-VII). In 
contrast to the 13-47-fold t-PA/u-PA selectivity exhibited by 
substrates isolated from the subtraction library, the t-PA/u-PA 
selectivity of these substrates identified using stsmdard sub- 
strate phage display was 2.5-4.9-fold. These data verify the 
value of the subtraction library and indicate that, in this in- 
stance, a single subtraction step resulted in the recovery of 
substrates whose selectivity was enhanced by approximately 
10-fold. 

Comparison of Consensus Sequences Derived Using Sub- 
strate Phage Display and Substrate Subtraction Methods — We 
examined the consensus sequences derived using both stand- 
ard £ind subtractive phage display to identify determinants of 
substrates that mediated t-PA/u-PA selectivity. Using stand- 
ard substrate phage display protocols, we inferred the consen- 
sus sequence GR 4 XA, where X could be a number of different 
amino acids but most often was arginine, for optimal substrates 
of t-PA (18). Strong conservation of particular amino acids at 
the P3 position of substrates was not observed in these exper- 
iments; however, slightly more than half (57%) of the selected 
substrates did contain one of five hydrophobic residues (Leu, 
He, Val, Phe, Tyr) at this position. Similarly, no obvious con- 
servation of any particular amino acid was observed at the P4 



position of the optimized substrates, although the majority 
(60%) contained one of four small residues (Ser, Thr, Gly, Ala) 
at this position (18). 

To assure that members of the substrate subtraction library 
were highly active, as well as unusually specific, toward t-PA, 
the substrate subtraction library was first subjected to high 
stringency selection using t-PA. Consequently, it is not surpris- 
ing that substrates obtained from the subtraction hbrary agree 
well with the GR J, XA consensus sequence observed in the 
optimized library. Significant differences were observed, how- 
ever, when comparing residues found at the P3 and P4 posi- 
tions of substrates from the two libraries. 100% of the sub- 
strates from the subtraction library contained a large residue 
at P3, and 97% of the sequences contstined either arginine or a 
large hydrophobic residue at this position. Arginine was the 
most frequently observed individual residue at P3 in the sub- 
traction library, occurring in approximately 38% of the selected 
sequences. In addition, while 60% of the sequences in the 
optimized library contained small residues at P4, 75% of the 
sequences from the subtraction library contained a large resi- 
due at this position. Comparison of these consensus sequences, 
therefore, suggested that the P3 and P4 residues were key 
determinants of enhanced t-PA/u-PA selectivity, with prefer- 
ences for arginine at P3 and large hydrophobic residues at P4. 

Important Distinctions between Substrate Phage Display and 
Substrate Subtraction Techniques — A key distinction between 
substrate subtraction libraries and substrate phage libraries 
obtained using conventional protocols is that the former are 
actively influenced by both positive and negative determinants 
of specificity. While substrate phage display identifies the most 
labile substrates for a particular enzyme, substrate subtraction 
identifies the most selective substrates for the enzyme. 

With enzymes for which the most labile substrates are also 
selective substrates (.e.g. u-PA),^ the two protocols may yield 
similar results. Even in this situation, however, both tech- 
niques remain useful because it seems unlikely that the stand- 
ard protocols will yield the most selective substrate or that 
substrate subtraction will yield the most labile substrate. In 
contrast, with enzymes like t-PA, where the most labile sub- 
strates are not also highly selective substrates, the two meth- 
ods produce distinct results. For example, the most labile sub- 
strate sequence identified for t-PA, PFGRSA, was isolated by 
standard substrate display protocols and not by substrate sub- 
traction. This sequence was also cleaved efficiently by u-PA 
{k^JK^ = 320 M"^ s"^) and therefore exhibited only 4.7-fold 
t-PA/u-PA selectivity. This observation emphasizes the impor- 
tance of subtle determinants of substrate specificity and the 
ability of subtraction Kbraries to identify combinations of 
amino acids that maximize selectivity. The most selective 
P4-P1 sequence, FRGR (Table I, peptide X), occurred in the 
substrate subtraction library but not in the substrate phage 
library, although the later protocol included one additioned 
round of screening and approximately 40% more positive 
phage. 

For reasons described above, substrate subtraction tech- 
niques greatly facilitate elucidation of subsite occupancy that 
contributes primarily to the specificity, rather than to the cat- 
alytic efficiency, of substrate hydrolyses. Data presented in this 
study suggest that appropriate occupancy of P3 for t-PA sub- 
strates is one example of this type of interaction. In physiolog- 
ical contexts, such subtle specificity determinants may be es- 
sential. For example, Erythrina trypsin inhibitor inhibits t-PA 
but not u-PA. Although the molecular basis of this specificity 



Phage Substrate Subtraction Libraries 



16607 



Table I 

Comparison ofk^„„ and k^„,/K„ for the hydrolysis by t-PA or u-PA of peptides selected using substrate subtraction or standard 
substrate phage display protocols 





Substrate 


t-PA 


*»l«m *..t 


K„ 




t-PA/u-PA 
selectivity 






S-' ^ 
























I 


KKSPGR i WGGSVftH 


0.0043 15,000 


0.29 0.003 


3400 


0.88 


0.33 






Peptides from substrate subtraction library 








II 


LGGSGQRGR i KALE 


0.99 2300 


430 0.02 


2180 


9.2 


47 


III 


LGGSGERAR i GALE 


0.073 1410 


52 0.004 


970 


4.0 


13 


rv 


LGGSGHYGR i SGLE 


1.29 4010 


322 0.059 


3800 


15 


21 




GGSGWLGR i RGLVPE 


Peptides from standard substrate phage library 








V 


2.0 10.000 


200 0.32 


4000 


80 


2.5 


VI 


GGSGYIGR I RGLVPE 


1.6 7300 


220 0.20 


4400 


45 


4.9 


VII 


GSGPFGR i SALVPE 


3.3 2200 


1500 0.71 


2200 


320 


4.7 






Minimized peptides from substrate subtraction library 








VIII 


ygrJ,s 


23.7 6000 


3950 2.6 


11,400 


230 


17 


DC 


RGrI K 


15.3 16,600 


922 0.76 


46,500 


16 


57 


X 


PRGR i K 


12.2 9800 


1240 0.14 


8600 


16 


78 



" Positional nomenclature of subsite residues. Arrows denote the position of peptide bond hydrolysis. The peptide bond is cleaved between PI and 
PI'. The S.E. in these determinations was 4-22%. 



s obscure, it is intriguing that the Erythrina inhibitor 
n arginine in the P3 position (34). 
Although standard substrate phage display protocols can be 
used successfully to isolate selective substrates, this task will 
frequently prove arduous due to the necessity of isolating, 
sequencing, and functionally characterizing an inordinately 
large number of phage. We find, however, that a straightfor- 
ward modification of these protocols to include a subtraction 
step substantially enhances the efficiency of this process and 
therefore significantly increases the rate at which highly selec- 
tive substrates can be discovered. For example, we performed 
two independent substrate phage display experiments with 
t-PA using standard protocols. Characterization by dot blot 
analysis of a total of 105 individual phage clones isolated in the 
final round of selection in the two experiments revealed that 
either 6.7% (first experiment) or 4.2% (second experiment) of 
these phage clones were t-PA-selective substrates. In contrast, 
characterization of 71 phage clones from a substrate subtrac- 
tion library indicated that 54.9% of these phage were t-PA- 
selective substrates. In other words, depending on the protocol 
being used, we were forced to isolate, prepare, and character- 
ize, on average, either 15-24 clones (standard substrate dis- 
play) or 1.8 clones (substrate subtraction) to obtain a single 
t-PA-selective phage. 

Minimization of the Selective Peptide Substrates — The ki- 
netic analysis described above was performed using substrate 
peptides that were 14 amino acids in length. To confirm that 
the specificity we observed was inherent in the selected 
hexapeptide sequences, we examined the kinetics of cleavage of 
short peptides containing only sequences found within selected 
hexapeptide sequences. Tetrapeptide DC and pentapeptide X, 
for example, were cleaved 57 or 78 times, respectively, more 
efficiently by t-PA than by u-PA and were therefore actually 
more selective than the 14-mer peptides (II-IV). Furthermore, 
tetrapeptide K and pentapeptide X were significantly more 
labile substrates of t-PA than any of the 14-mer peptides (II- 
IV). Thus, compared with the 14-mers, the tetra- and pentapep- 
tides not only maintained specificity but also acquired in- 
creased activity. These data confirm the proposed status of the 
P3 and P4 residues as specificity determinants for substrates of 
t-PA £uid u-PA, suggest a particularly prominent role of the P3 
residue in this capacity, and demonstrate that highly selective 
substrates of t-PA can be created by appropriate occupancy of 
the P3-P1' subsites alone. These observations may also provide 
a firm basis for the rational design of highly selective, small 
molecule inhibitors of t-PA. Although hydrolysis of the selec- 



tive, small peptide substrates by t-PA is characterized by 
values in the millimolar range, it has been routinely observed 
that the introduction of a transition state bond geometry adja- 
cent to the PI residue of a protease substrate can create either 
a reversible inhibitor whose affinity for the target protease is 
enhanced by 3-6 orders of magnitude or an irreversible inhib- 
itor with an impressive second order rate constant for inhibi- 
tion of the target protease (>10° s~^) (for a review, see Ref. 
35). Similar results using the substrates identified in this study 
would create highly selective t-PA inhibitors, with afiinities in 
the low nanomolar range, that might be further improved by 
subsequent, systematic chemical modification. 

Design and Characterization of Variants of PAI-1 That Are 
Selective for t-PA — To test the prediction, based on analysis of 
the cleavage of peptide substrates, that the P3 residue can 
mediate the ability of an inhibitor to discriminate between t-PA 
and u-PA, we performed site-specific mutagenesis of PAI-1, the 
primary physiological inhibitor of both t-PA and u-PA. Three 
variants of PAI-1 were produced and characterized: a variant 
in which the P3 serine* was converted to an arginine residue, a 
variant in which the P4 vahne was replaced by a glutamine 
residue, and a double mutant containing both of these substi- 
tutions. Kinetic analysis of the inhibition of t-PA and u-PA by 
these variants of PAI-1 proved consistent with conclusions 
drawn ft-om the previous experiments utilizing peptide sub- 
strates. The second order rate constants for inhibition of t-PA 
and u-PA by wild type PAI-1 were 1.6 X 10* M"^ s"^ and 1.9 X 
10'' s~^, respectively (Table II). Thus, wild type PAI-1 
exhibits approximately 11.9-fold specificity toward u-PA. By 
contrast, the second order rate constants for inhibition of t-PA 
and u-PA by the P3 arginine variant of PAI-1 were, respec- 
tively, 1.4 X 10^ M"^ s"' and 1.0 X 10^ m"^ s~\ an approxi- 
mately 170-foid reversal in specificity (Table II). This large 
alteration in specificity was achieved without sacrificing activ- 
ity toward the target enzyme; the P3 arginine mutation re- 
duced activity of PAI-1 toward u-PA by a factor of approxi- 
mately 190 without significantly affecting reactivity toward 
t-PA. 

Individual mutation of the P4 valine of wild type PAI-1 to a 
glutamine residue had no effect on the rate of inhibition of 
either t-PA or u-PA. As suggested by the predominance in the 
subtraction library of substrates containing both large P3 and 
large P4 residues, however, the P4 glutamine mutation did 



16608 



Phage Substrate Subtraction Libraries 



Wild type PAI-1 
PAI-1/P3R 
PAI-1/P4Q 
PAI-1/P4Q,P3R 



i the t-PA selectivity of the P3 arginine variant of 
PAI-1. The second order rate constants for the inhibition of 
t-PA and u-PA by the P4 glutamine, P3 arginine double mutant 
of PAI-1 were 1.4 X 10« m"' s"' and 2.9 X 10" M'^ s-\ 
respectively (Table II). While maintaining ftill activity toward 
t-PA, then, the double mutant exhibited an approximately 600- 
fold enhanced t-PA/u-PA selectivity compared with wild type 
PAI-1 and a 3.5-fold greater t-PA selectivity than the P3 argi- 
nine variant of PAI-1. The absolute t-PA/u-PA selectivity of 
wild type PAI-1, the P3 Arg single mutant, and the P3 Arg, P4 
Gin double mutant was approximately 0.08, 14, and 48, 
respectively. 

Contribution of Structural Studies to Understanding Re- 
stricted Specificity at P2 and the Critical Role of the P3 and P4 
Residues in Mediating Specificity— At the time these studies 
were initiated, no structural information was available for the 
protease domain of either t-PA or u-PA. During preparation of 
this manuscript, however, both of these important structures 
were described (2, 3). The t-PA structure contained a benzami- 
dine molecule, which occupied the SI pocket, and the u-PA 
structure included a covalently bovmd, tripeptide chloromethyl 
ketone that filled the SI, S2, and S3 sites. These structural 
reports have provided a clear molecular basis for our observa- 
tion that both t-PA and u-PA display a strong preference for 
glycine at the P2 subsite of substrates. In both enzymes, the 
side chain of residue 99 (chymotrypsin numbering), a tyrosine 
in t-PA and a histidine in u-PA, severely restricts the size of the 
S2 pocket. Modeling of even an alanine residue into this subsite 
produces a slight steric conflict with residue 99 of the enzymes, 
and larger residues at P2 create extensive clashes. 

Another important observation of the structural studies is 
that the major distinction between the substrate binding cleft 
of t-PA and u-PA occurs in the region corresponding to the aryl 
binding site of thrombin (2, 3, 36). In u-PA this pocket is 
partially filled by an insertion of two amino acids (threonine 
97A, leucine 97B; chymotrypsin numbering) that is absent in 
t-FA. Consequently, the aryl biding site is significantly larger 
in t-PA than in u-PA. In addition, unlike Ser^'''* of u-PA, Arg"" 
of t-PA extends toward, and partially occupies, the aryl binding 
site. However, because Arg^^" appears highly flexible and mo- 
bile (2), the extent to which this residue actually influences the 
aryl binding site of t-PA in solution remains uncertain. 

Depending on the precise binding mode, the aryl binding site 
can interact with any of several residues of a particular sub- 
strate or inhibitor. If the substrate or inhibitor adopts a canon- 
ical conformation within the active site cleft, the P4 residue 
will occupy the aryl binding site (37). However, the most ex- 
tensively studied aryl binding site, that of thrombin, interacts 
with the P9 residue of the physiological substrate fibrinogen 
(38). In addition, the P2 glycine of substrates selected in this 
study, with its greatly expanded range of allowed conforma- 
tions compared with other natural amino acids, could allow the 
P3 residue to interact with the aryl binding site. Consequently, 
the structural studies indicating that the major differences 
between the active site cleft of the two proteases occur in the 
aryl binding site are consistent with our substrate phage stud- 



ies, which demonstrate that the P3 and P4 residues are the 
primary determinants of the ability of a substrate to distin- 
guish t-PA and u-PA. 

Differences at position 217, a leucine in t-PA and an arginine 
in u-PA, may also contribute to specificity differences observed 
in this study for the two enzymes. In the u-PA structure, Arg^" 
adopts an unusual conformation that allows formation of a salt 
bridge with the P3 glutamic acid of the bound inhibitor. 
Clearly, a similar juxtaposition of Arg^^'' and the P3 arginine 
residue of a substrate or inhibitor would create an unfavorable 
electrostatic interaction that would be absent when the same 
substrate or inhibitor interacted with t-PA. However, if Arg^*' 
of u-PA adopted a more commonly observed conformation, this 
residue would extend into solvent and could be located at a 
significant disteuice fi:om the P3 arginine residue. Thus, the 
extent to which Arg^" of u-PA and Leu^" of t-PA contribute to 
specificity distinctions between the two enzymes remains an 
open question. 

As discussed above, insights gained fi:om this study and from 
very recent structural studies do allow the identification of 
candidate residues that may mediate important specificity dis- 
tinctions between t-PA and u-PA. The absence of key structural 

information regarding the binding mode(s) of a particular se- 
lective substrate or inhibitor to both t-PA and u-PA, however, 
precludes a definitive description at atomic resolution of the 
mechanisms by which the specificity observed in our studies 
has been achieved. 

Limitations of Structural Studies and Contribution of Sub- 
strate Phage and Substrate Subtraction Techniques to Under- 
standing Protease Specificity — It will not be possible to under- 
stand enzyme catalysis and specificity without extensive 
structural information. On the other hand, it is very unlikely 
that the recent structural studies alone could have been used to 
predict the most selective substrates of either t-PA or u-PA. 
One difficulty described above is that the structural data for 
the proteases does not necessarily provide information regard- 
ing the precise binding mode and backbone conformation of an 
individual substrate, a critical detail that determines which, if 
any, region of the substrate will interact with specific regions 
{e.g. the aryl binding site) of the enzyme. 

Another limitation is that the static structures do not neces- 
sarily provide information regarding the role of molecular dy- 
namics in enzyme catalysis and specificity. Consequently, con- 
tributions to substrate specificity arising from subtle 
differences in dynamic properties of related enzymes may not 
be detected by the x-ray structures. The development of a 
detailed, molecular understanding of enzyme mechanism will 
therefore require information firom studies that utilize a wide 
variety of techniques of molecular biology, biochemistry, and 
biophysics. Substrate phage display and substrate subtraction 
libraries can contribute to these efforts by providing a power- 
ful, combinatorial approach to the identification of key deter- 
minants of substrate reactivity and specificity. 

Conclusion — The rational design of small molecule inhibitors 
as therapeutic agents is often complicated by the necessity of 
discriminating between closely related enzymes (39). We dem- 



Phage Substrate Subtraction Libraries 



onstrate here that appropriate selections of substrate phage 
can achieve this discrimination. Substrate subtraction hbraries 
are likely to provide substrates that can distinguish between 
any two distinct proteases, and there is no theoretical reason 
why multiple proteases could not be used in the subtraction 
step to achieve even greater specificity. Moreover, it should be 
possible to prepare both substrate and substrate subtraction 
libraries as described above for any enzymes that can use 
peptides or proteins as substrates. These techniques might be 
adapted to protein kinases, for example, by using antibodies 
against phosphoserine, phosphothreonine, or phosphot3TOsine 
during the selection of substrate phage. Consequently, the con- 
struction and characterization of substrate and substrate sub- 
traction libraries may make substantied contributions to the ra- 
tional design of highly speciiic, small molecule inhibitors of 
selected enzymes, a problem of paramount importance during 
the development of new therapeutic agents. In addition, by re- 
vealing speciiicity determinants that might otherwise remain 
obscure, these libraries will provide key insights into the molec- 
ular basis of specificity for a variety of important enzymes. 

Acknowledgments — We thank Drs. Guy Salvesen, Steve Kent, Dave 
Loskutoff, and Mark Ginsberg for stimulating discussions and critical 
review of this manuscript. We thank Lynn Mayfield, Bikash C. 
Pramanik, and Dr. Clive Slaughter for peptide synthesis and mass 
spectral analysis and Karen Barker for initial kinetic analysis of the 
peptide substrates. 

REFERENCES 

1. Perona, J. J., and Craik, C. S. (1995) Protein Sci. 4, 337-360 

2. Lamba, B., Bauer, M., Huber, R., Fischer, S.. Rudolph, R., Kohnert, U., and 

Bode, W, (1996) J. Mol. Biol. 258, 117-135 

3. Spraggon, G., Phillips, C, Nowak, U. K., Pouting, C. P., Saunders, D., Dobson, 

C. M., Stuart, D. I., and Jones, E. Y. (1995) Structure 3, 681-691 

4. CoUen, D., and Lijnen, H. R. (1991) Blood 78, 3114-3124 

5. Madison, E. L. (1994) Fibrinolysis 8, Suppl. 1, 221-236 

6. Ganu, V. S., and Shaw, E. (1982) Peptide Protein Res. 20, 421-428 

7. Madison, E. L., Coombs, G. S., and Corey, D. K. (1995) J. Biol. Chem. 270, 

7558-7562 

8. Coombs, G. S., Dang, A. T., Madison, E. L., and Corey, D. R. (1996) </. Biol. 

Chem. 271, 4461-4467 

9. Carmeliet, P., Schooryans, L., Kieckens, L., Ream, B., Degen, J., Branson, R., 



De Vos, R., van den Oord, J. J., Collen, D., and Mulligan, R. C. (1994) 
Nature 368, 419-424 

10. Dan0, K., Andreason, P. A., Grondahl-Hansen, J., Kristensen, P., Nielsen, L. 

S., and Skriver, L. (1985) Adv. Cancer Res. 44, 139-266 

11. Tsirka, S. E., Gualandris, A., Amaral, D. G., and Strickland, S. (1995) Nature 

12. Fukudome, K., and Esmon, C. T. (1994) J. Biol. Chem. 269, 26486-26491 

13. Min, H, Y., Doyle, L. V., Vitt, C. R., Zandonella, C. L„ Stratton-Thomas, J. R., 

Shuman, M. A., and Rosenberg, S. (1996) Conccr Res. 56, 2428-2433 

14. Seeds, N. W., Williams, B. L., and Bickford, P. 0. (1995) Science 270, 

1992-1994 

15. Ossowski, L. (1988) Cell 52, 321-328 

16. Shapiro, R. L„ Duquette, J. G., Roses, D. F., Nunes, I., Harris, M. N., Kamino, 

H., Wilson, E. L., and Rifldn, D. B. (1996) Cancer Res. 56, 3597-3604 

17. Matthews, D, J., and Wells, J. A. (1993) Science 260, 1113-1117 

18. Ding, L,, Coombs, G. S., Strandberg, L., Navre, M.. Corey, D. R., and Madison, 

E. L. (1995) Proc. Natl. Acad. Sci. U. S. A. 92, 7627-7631 

19. Smith, M. M., Shi, L., and Navre, M. (1995) J. Biol. Chem. 270, 6440-6449 

20. Jameson, G. W., Roberts, D. V., Adams, R. W., Kyle, S. A., and Ellmore. D. T. 

(1973) Biochem. J. 131, 107-117 

21. Urano, T., Urano, S., and Castellino, F. J. (1988) Biochem. Biophys. Res. 

Commun. 150, 45-51 

22. Alpert, A, J, (1990) J. Chromatogr. 499, 177-196 

23. Taylor, J. R. (1982) An Introduction to Error Analysis: The Study of 

Uncertainties in Physical Measurements, University Science Books, Mill 
Valley, CA 

24. Tucker, H. M., Mottonen, J., Goldsmith, E. J., and Gerard, R. D. (1995) NaL 

Struct. Biol. 2, 442-445 

25. Zoller, M. J., and Smith, M. 11984) DNA 3, 479-488 

26. Kunkel, T. A. (1985) Proc. Natl. Acad. Sci. U. S. A. 82, 488-492 

27. Sancho, E., Tonge, D. W., Hockney, R. C, and Booth, N. A. (1994) Eur. 

J. Biochem. 224, 125-134 

28. Chase, T., and Shaw, E. (1967) Biochem. Biophys. Res. Commun. 29, 508-514 

29. Olson, S. T., Bock, P. E., Kvassman, J., Shore, J. D., Lawrence, D. A., 

Ginsburg, D., and Bjbrk, 1. (1995) J. Biol. Chem. 270, 30007-30017 

30. Holmes, W. E., Lijnen, H. R., and Collen, D. (1987) Biochemistry 26, 

5133-5140 

31. Beatty, K., Bieth, J., and Travis, J. (1980) J. Biol. Chem. 255, 3931-3934 

32. Madison, E. L., Goldsmith, E. J., Gerard, R. D., Gething, M. J., Sambrook, J. 

F. , and Bassel-Duby, R. S. (1990) Proc. Natl. Acad. Sci. U. S. A. 87, 
3530-3533 

33. Madison, E. L., and Sambrook, J. F. (1993) Me(/ioiis Emymol. 223, 249-271 

34. Teixeira, A. V., Dowdle, E. B. D., and Botes, D. P. (1994) Biochim. Biophys. 

Acta 1217, 23-28 

35. Baggio, R., Shi, Y.-Q., Wu, Y.-Q., and Abeles, R. H. (1996) BiochemUlry 35, 

3561-3553 

36. Bode, W., Turk, D., and Sturzebecher, J. (1990) Eur. J. Biochem. 193, 175-183 

37. Bode, W., Turk, D., and Karshikov, A. (1992) Protein ScL 1, 426-471 

38. Ni, F., Meinwald, Y. C, Visquez, M., and Scheraga, H. A. (1989) Biochemistry 

28, 3094-3105 

39. Crooke, S. T. (1996) Nat. Biolechnol. 14, 238-241 



